Achieving All with No Parameters: Adaptive NormalHedge
نویسندگان
چکیده
We study the classic online learning problem of predicting with expert advice, and propose a truly parameter-free and adaptive algorithm that achieves several objectives simultaneously without using any prior information. The main component of this work is an improved version of the NormalHedge.DT algorithm [Luo and Schapire, 2014], called AdaNormalHedge. On one hand, this new algorithm ensures small regret when the competitor has small loss and almost constant regret when the losses are stochastic. On the other hand, the algorithm is able to compete with any convex combination of the experts simultaneously, with a regret in terms of the relative entropy of the prior and the competitor. This resolves an open problem proposed by Chaudhuri et al. [2009] and Chernov and Vovk [2010]. Moreover, we extend the results to the sleeping expert setting and provide two applications to illustrate the power of AdaNormalHedge: 1) competing with time-varying unknown competitors and 2) predicting almost as well as the best pruning tree. Our results on these applications significantly improve previous work from different aspects, and a special case of the first application resolves another open problem proposed by Warmuth and Koolen [2014] on whether one can simultaneously achieve optimal shifting regret for both adversarial and stochastic losses.
منابع مشابه
Distributed Fuzzy Adaptive Sliding Mode Formation for Nonlinear Multi-quadrotor Systems
This paper suggests a decentralized adaptive sliding mode formation procedure for affine nonlinear multi-quadrotor under a fixed directed topology wherever the followers are conquered by dynamical uncertainties. Compared with the previous studies which primarily concentrated on linear single-input single-output (SISO) agents or nonlinear agents with constant control gain, the proposed method is...
متن کاملRobust Adaptive Fuzzy Sliding Mode Control of Permanent Magnet Stepper Motor with Unknown Parameters and Load Torque
In this paper, robust adaptive fuzzy sliding mode control is designed to control the Permanent Magnet (PM) stepper motor in the presence of model uncertainties and disturbances. In doing so, the nonlinear model is converted to canonical form, then, for designing the controller, the robust sliding mode control is designed to decrease the effects of uncertainties and disturbances. A class of fuzz...
متن کاملA Probe into Adaptive Transfer across Writing Contexts: A Case of an EGAP Class
In an effort to expand the disciplinary discussions on transfer in L2 writing and because most studies have focused on transfer as reuse and not as an adequate adaptation of writing knowledge in new contexts, the present study as the first of its kind aimed to explore the issue of adaptive transfer in an English for General Academic Purposes (EGAP) writing course. The study thus focused on type...
متن کاملAnalysis of Speed Control in DC Motor Drive Based on Model Reference Adaptive Control
This paper presents fuzzy and conventional performance of model reference adaptive control(MRAC) to control a DC drive. The aims of this work are achieving better match of motor speed with reference speed, decrease of noises under load changes and disturbances, and increase of system stability. The operation of nonadaptive control and the model reference of fuzzy and conventional adaptive contr...
متن کاملPrediction with Advice of Unknown Number of Experts
In the framework of prediction with expert advice, we consider a recently introduced kind of regret bounds: the bounds that depend on the effective instead of nominal number of experts. In contrast to the NormalHedge bound, which mainly depends on the effective number of experts but also weakly depends on the nominal one, we obtain a bound that does not contain the nominal number of experts at ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- CoRR
دوره abs/1502.05934 شماره
صفحات -
تاریخ انتشار 2015